[Speculative Decoding] Support draft model on different tensor-parallel size than target model #5414
+389
−59
We went looking everywhere, but couldn’t find those commits.
Sometimes commits can disappear after a force-push. Head back to the latest changes here.